home *** CD-ROM | disk | FTP | other *** search
Text File | 1993-10-01 | 125.5 KB | 2,893 lines |
-
-
-
-
-
-
-
-
-
-
-
- TexaSoft's
- USING KWIKSTAT 3.3 (Condensed Manual)
- (C)Copyright 1991,1992 Alan C. Elliott
-
-
- Winner of the 1992
- SIA Award for
- "Best Math or Enginnering Program"
-
-
-
- For additional information on this product, contact TexaSoft, P.O. Box
- 1169, Cedar Hill, Texas 75104 (214) 291-2115, Fax: (214) 291-3400,
- Compuserve:70721,3145.
-
- ACKNOWLEDGEMENTS
-
- This manual was written by Alan Elliott and Marcia Stoesz.
-
- ALL RIGHTS RESERVED
-
- Note:There is important information in the file LATENEWS.DOC. To view
- this file enter the command TXVIEW LATENEWS.DOC. The file is on disk 3
- in the 5.25 inch version.
-
-
- NOTE:TO PRINT AN ORDER FORM, CHOOSE THE "ABOUT" OPTION IN THE HELP
- MENU, YOU WILL BE GIVEN THE OPTION TO PRINT AN ORDER FORM. OR, PRINT
- THE FILE NAMED KSORDER.TXT.
-
-
-
-
-
-
-
-
-
-
-
-
-
-
- --------------------------------------------------------------------
- Become a Registered User, Print Order Form
-
-
- 1
-
-
-
- KWIKSTAT CONDENSED MANUAL Version 3.3
- --------------------------------------------------------------------
-
-
- PART I AN OVERVIEW OF KWIKSTAT
-
- KWIKSTAT is a statistical data analysis program. It was designed by
- professional statistical consultants and researchers to allow you to
- quickly and easily use the most commonly needed statistical data
- analysis procedures.
-
- WHY USE KWIKSTAT?
-
- KWIKSTAT can help you:
-
- 1. decide the appropriate data analysis procedure to use,
-
- 2. enter data or use data already in popular formats such as dBASE,
- 1-2-3 or ASCII,
-
- 3. provide a complete analysis in one pass so the user does not have
- to run multiple programs to perform a single analysis, and
-
- 4. provide interpretations of the results to assist the user in making
- decisions based on the outcomes of the analyses.
-
- KWIKSTAT REQUIREMENTS
-
- KWIKSTAT is designed to run on IBM PC and 100% compatible computers
- including the IBM PS/2 computers. It requires PC-DOS or MS-DOS version
- 3.0 or higher. Your computer should contain at least 364K or more of
- free RAM memory. KWIKSTAT graphics require a CGA, EGA, VGA or Hercules
- compatible monitor. Many printers are supported. A mouse is optional.
-
- INSTALLATION
-
- To install on a hard disk, place the KWIKSTAT distribution disk in the
- A: drive and enter the command A:INSTALL and follow the instructions
- on the screen. To use KWIKSTAT on a two-disk 360K machine, place
- disk three (3) in the A: drive and type the command FTIPS.
-
- NOTE:This manual is a condensed version of the printed and illustrated
- version of the manual you receive when you register. Please ignore
- references to figures.
-
-
-
-
-
-
-
- --------------------------------------------------------------------
- Become a Registered User, Print Order Form
-
-
- 2
-
-
-
- KWIKSTAT CONDENSED MANUAL Version 3.3
- --------------------------------------------------------------------
-
-
- USING THE KWIKSTAT MENU
-
- Once you installed and setup KWIKSTAT, to begin the program, enter the
- the KS command from the DOS prompt.
-
- The main menu contains three options, Data, Analyze and Helps. Using
- the right and left arrow keys on the cursor pad, you can move the menu
- selection to one of the other two menu bar options. Pressing the right
- arrow key once, moves the menu bar option from Data to Analyze. The
- Data pull-down menu vanishes and the Analyze pull-down menu appears.
- Pressing the left arrow key moves the selection back to the Data menu.
- Or, point to a menu option with the mouse and click.
-
- To select options from an extended menu (pulled-down), use the up and
- down arrow keys on the cursor pad to highlight the option you desire,
- then press the Enter key. Or, press the first letter of the option
- name. If you are using a mouse, point to the selection with the mouse
- pointer and click.
-
- USING THE ANALYZE MENU
-
- The KWIKSTAT Analyze menu allows you to choose which analysis module
- to run. Usually, you first open a database in the Data menu, then
- choose one of the options in the Analyze menu. For example, to
- calculate descriptive statistics on information in the EXAMPLE
- database, first open the EXAMPLE database by choosing Open a database
- to use in the Data menu and selecting EXAMPLE from the database list.
- Then press the right arrow key once to open the Analyze pull-down
- menu. Then choose the Descriptive Statistics and Graphs option from
- the Analyze menu. This begins the Descriptive Statistics and Graphics
- module, which contains a menu of procedure options.
-
- USING THE KWIKSTAT HELP SYSTEM
-
- The KWIKSTAT Help system contains two levels of help. From the "Helps"
- pull-down menu, you can choose the Help On Using The Program option
- or the Decide What Analysis To Use Option. Other options on this menu
- are About Kwikstat, which gives copyright information about the
- program and allows you to print an order form. The GO to DOS, Return
- with Exit (Shell) option allows you to temporarily return to the DOS
- prompt. The Change Setup option allows you to setup KWIKSTAT fror your
- computer.
-
-
-
-
-
- --------------------------------------------------------------------
- Become a Registered User, Print Order Form
-
-
- 3
-
-
-
- KWIKSTAT CONDENSED MANUAL Version 3.3
- --------------------------------------------------------------------
-
-
- TUTORIAL:TRY THIS EXAMPLE
-
- This short tutorial will give you a feeling for how to use KWIKSTAT. It
- is not intended to be thorough, but simply to lead you though a common
- procedure. It will assume you are using KWIKSTAT on a hard disk. To begin
- KWIKSTAT, you must first be in the \KWIKSTAT directory on your hard disk.
- Use the CD (Change Directory) command from the DOS prompt to change to
- the \KWIKSTAT directory by using the command:
-
- CD\KWIKSTAT
-
- Once in the \KWIKSTAT directory, begin KWIKSTAT with the KS command:
-
- KS
-
- The Data menu that should appear is the same as was illustrated
- previously in Figure 1.1. (If the Analyze menu appears, press the left
- arrow key once to open the Data menu.) The Data pull-down menu is
- extended, as described above in the section "Using the KWIKSTAT Menu
- System".
-
- ACCESSING THE KWIKSTAT HELP SCREENS
-
- To examine the KWIKSTAT HELP menu, press the F1 function key. You can
- think about the HELP procedure as a book, with screens instead of
- pages. It is really a condensation of the manual. The KWIKSTAT Program
- Help screen menu was illustrated in Figure 1.3. To look at a
- particular topic, enter the screen number you desire. For example,
- look at screen 7. Type the number 7, and press Enter:
-
- Enter SCREEN NUMBER or Enter to Cancel:7 (You enter the 7.)
-
- If you enter a 7, KWIKSTAT displays screen 7. (If you press Enter without
- first typing a number, the Help is canceled, and you will return to the
- menu.) Once you have displayed screen 7, to move to screen number 8,
- press Enter. To go back to the menu, type the "M" key. To exit the HELP
- module, press the Enter key from the main Help menu or the Esc key from a
- help screen. Press Enter now. This takes you back to the KWIKSTAT Data
- pull-down menu. Every module has the help screens available.
-
- The KWIKSTAT "Decision" help screen is available from the Helps pull-down
- menu. To decide what kind of descriptive analysis to use for a single
- variable, choose help screen option 1 by typing a 1, then press Enter.
- A screen describing descriptive analysis options is displayed (as in
- Figure 1.5). For example, if your data is near normal (quantitative),
- use the B (detailed statistics), C (summary statistics) and/or E
- (histogram) options in the Statistics module. Pressing Esc will end
- --------------------------------------------------------------------
- Become a Registered User, Print Order Form
-
-
- 4
-
-
-
- KWIKSTAT CONDENSED MANUAL Version 3.3
- --------------------------------------------------------------------
-
-
- the help system and return you to the main menu.
-
- EXAMPLE OF DESCRIPTIVE STATISTICS
-
- This example will use data already stored in a dBASE ".DBF" file
- named EXAMPLE currently on the KWIKSTAT disk. To open this database,
- extend the Data pull-down menu. Then select the Open a Database
- option. A "PICK" menu will appear. That is, a list of database names
- will be displayed, and you can pick one of the names from this menu.
- Choose the EXAMPLE database. (If the EXAMPLE database does not appear
- on the list of databases, you may not have installed the program
- correctly.)
-
- Once the database is opened, a notice at the bottom of the screen tells
- you that the database named EXAMPLE is open, and it contains 50 records.
- Press the "L" key once to choose the List the Contents of a database
- option from the menu. This will list the contents of EXAMPLE database to
- the screen. Press Enter several times to list the entire database to the
- screen. When the list is finished, you will return to the Data pull-down
- menu.
-
- Extend the Analyze pull-down menu. Choose the Descriptive Statistics
- and Graphs option from the Analyze menu. KWIKSTAT now switches to the
- Descriptive module (which may take a few seconds). Soon, you will see
- the Descriptive Statistics menu, as illustrated in Figure 1.6.
-
- From the Descriptive Statistics menu, choose Detailed statistics on a
- single variable. The program now displays the variables available for
- analysis from the database.
-
- Choose variable number 2, "AGE". Before the statistics for this
- variable are displayed, two options are presented. First, you are
- prompted you with the question:
-
- Specify Confidence Interval level (.5 to .99)
- (Default is .95)
-
- For this example, press Enter to accept the default, which tells the
- program to display a 95% confidence interval for the data on the
- statistics screen. Next, a second option appears with the prompt:
-
- Default for percentiles is Tukey 5 Number Summary
- Specify your own percentiles to calculate < yes > < No >
-
- When a Yes/No question appears on the screen, notice that the Y or the
- N will be uppercase. This means that if you press Enter without
- entering a Y or an N, the uppercase option is the default (No). For
- --------------------------------------------------------------------
- Become a Registered User, Print Order Form
-
-
- 5
-
-
-
- KWIKSTAT CONDENSED MANUAL Version 3.3
- --------------------------------------------------------------------
-
-
- this example, to choose No to the question, just press Enter. Or point
- to No with the mouse and click. The program will now perform a series
- of calculations on the data, and will produce a screen of descriptive
- statistics, and a box plot of the data. The results are illustrated in
- Figure 1.8.
-
- Notice that this screen is different from previous screens. The
- information on this screen is displayed in graphics mode (if you have a
- graphics monitor). Normally, information on the screen is in "text" mode.
- If you are using a color monitor, a text mode screen will display text in
- the colors you selected in the setup procedure. When graphs are displayed
- on the screen, the program must use a graphics screen mode. In graphics
- mode, some graphs appear only in black and white, although some graphs
- will appear in color. On all graphic screens in KWIKSTAT, a menu will
- appear at the bottom of the screen for a few seconds, then disappear.
- This allows you to capture or print the screen without the menu appearing
- on your printout. To bring the menu back, press the spacebar once. The
- menu options are still available even when the menu is not visible. The
- menus differ according to your setup and particular options available for
- the graphic display, but most graphic menus will include the following
- options:
-
- Esc:ExitR:Replot P:Print
-
- That is, press Esc to end the display, press R to replot (choose other
- display options) and P to print the graphic screen to the printer.
- Depending on your monitor setting, the menu may also contain a "Capture
- PCX" option. This option allows you to capture the graphic screen into a
- PCX type file that you can then use in other programs such as WordPerfect
- or Pagemaker.
-
- If you want a printed copy of this graphics screen, MAKE SURE YOUR
- PRINTER IS TURNED ON, and is ON LINE, and HAS PAPER. Then, press "P"
- (for Print).
-
- To return to the main Descriptives menu, press the Esc key. To end this
- module and return to the main KWIKSTAT menu, press Esc. To end KWIKSTAT
- from the main menu, press Esc again and answer Y to the prompt "End
- KWIKSTAT."
-
- This ends the tutorial. All of the procedures are explained more fully
- later in the manual. However, you may find that after finishing this
- tutorial, you will be able to use most of the KWIKSTAT features
- without any further aid from the manual.
-
-
-
- --------------------------------------------------------------------
- Become a Registered User, Print Order Form
-
-
- 6
-
-
-
- KWIKSTAT CONDENSED MANUAL Version 3.3
- --------------------------------------------------------------------
-
-
- Part II
- Using the KWIKSTAT Database
-
- The KWIKSTAT DATA pull-down menu in the main KWIKSTAT module is used
- to manage your data. From this menu you enter data, change data,
- create new data fields from existing ones, and perform other data
- maintenance tasks. Once your data is in the KWIKSTAT (dBASE-type)
- database, you can access the data from any of the other KWIKSTAT
- modules.
-
- Some of the KWIKSTAT procedures require that you have data in a file
- before you can do an analysis. Other procedures allow you to enter
- information from the keyboard at the time you request the procedure.
- Some procedures give you an option to enter data from a database, or
- from the keyboard.
-
- HOW DATA IS STORED IN KWIKSTAT
-
- A KWIKSTAT database uses the same file format as the dBASE III and dBASE
- IV programs. Therefore, data already stored in a dBASE III or dBASE IV
- file may be read directly into all the KWIKSTAT programs. The only
- exception to this is that KWIKSTAT does not read dBASE MEMO fields.
- Therefore, if your data in dBASE contains memo fields, you may have to
- create a subset of your database before using it in KWIKSTAT. Data from
- other programs can also be used in KWIKSTAT. Refer to the section called
- "Entering Data into the Database."
-
- The menu for the data options appears as the DATA option on the main
- KWIKSTAT menu. The following information describes how to use the options
- in this menu to create, manipulate and modify a database for use in
- KWIKSTAT.
-
- The Open a database to use option on the DATA menu allows you to access
- information in a dBASE file that you created in KWIKSTAT, in dBASE, or in
- any other program that creates .DBF files. Use this option to choose the
- database that you will be analyzing.
-
- When you choose the OPEN option on the DATA menu, a list of databases
- currently in the default directory will be displayed, as shown in Figure
- 2.2. To select a database, use the up and down arrow keys to highlight
- a database name, then press Enter. Or, point to the entry with the
- mouse pointer and click.
-
- If the database you want to use is not in the current (default)
- directory, you can temporarily change the default directory by
- pressing the F2 function key. Once a database is open, you will see
- its name at the bottom left of the screen, along with the number of
- --------------------------------------------------------------------
- Become a Registered User, Print Order Form
-
-
- 7
-
-
-
- KWIKSTAT CONDENSED MANUAL Version 3.3
- --------------------------------------------------------------------
-
-
- records in the database. You can edit, pack, modify, set missing
- values, subset and list the database using the other options on the
- DATA menu.
-
- DESIGNING AND CREATING A DATABASE
-
- The Create a new database option on the DATA menu is used to create a
- new database. The structure, or layout, of a database must be
- described before you enter your data. You need to give some thought
- to how your database will "look" so it will be in the proper format to
- do the analysis you desire. In the descriptions of statistical
- procedures (Part IV), specific examples are given about how a database
- should be constructed for a particular type of analysis.
-
- Kwikstat allows you to create a new database in two ways:
-
- 1. Choose from a predefined structure or
-
- 2. Create a customized database
-
- Both of these options are discussed in the sections below.
-
- USING A PREDEFINED DATABASE STRUCTURE
-
- You can choose to create a custom database structure (which was the
- only choice for version 3.0 and earlier) or you can choose from a list
- of pre-defined databases that are designed for specific analyses. The
- list below contains examples of some of the pre-defined database
- descriptions. For example, if you need to enter data for an
- independent group t-test, you would choose the option called "For
- independent group t-test or ANOVA." The proper database structure for
- this analysis will be created and then you can enter your data into
- the database.
-
- DEFINE THE FIELDS IN YOUR DATABASE
-
- When you first enter the definition mode, the blinking cursor will be
- in the FIELD NAME area. Enter a name, (must begin with a letter, can
- contain letters, numbers and "_" (underscore) and may be up to 10
- characters long) and press the ENTER key. The name you choose will be
- displayed in all capital letters, and the cursor will move to the next
- area, TYPE.
-
- In the TYPE area, you only need to enter the first character of the
- type (N, C, L or D), - Numeric, Character, Logical or Date, then
- press the ENTER key.
-
- --------------------------------------------------------------------
- Become a Registered User, Print Order Form
-
-
- 8
-
-
-
- KWIKSTAT CONDENSED MANUAL Version 3.3
- --------------------------------------------------------------------
-
-
- WIDTH is the number of characters reserved for the entry. Decimal is
- the number of decimal places (only for numbers). Note that the number
- of decimal places must be at least one less than the width. For
- example, if a number has the format ###.##, the width is 6 (count the
- decimal point), and the number of decimal places is 2. If DATE or
- LOGICAL is entered as type, the program will automatically assign a
- width of 8 or 1 respectively.
-
- Unless your database is big, you might make each field one more
- character wide that you actually need. This provides for
- unanticipated large numbers and facilitates data entry.
-
- Once a complete field description is entered, a next blank field
- description will appear, ready for entry. To end the creation process,
- type Ctrl-END (^END). As long as you have not ended the procedure, you
- may use the cursor keys to back up, and make any corrections. If you
- mess up, end the procedure with Esc and begin again. When you press
- ^End, the following message appears.
-
- Enter Records (data) into the database now
- <yes> <No>
-
- If you want to enter data now, answer "Y" to the question. Otherwise
- answer "N". You can always enter the data later.
-
- LIMITATIONS TO THE KWIKSTAT DATABASE
-
- Maximum of 250 fields.
- Maximum width of a field name is 10 characters.
- Maximum width of a cell is 60 characters (15 for numbers).
- Dates are always 8 characters and logical fields are 1 character wide.
- Memo fields are not supported.
-
- DATABASE AND ANALYSIS EXAMPLES
-
- This section provides you with two examples of using Kwikstat. Please go
- over these examples before creating your own database and performing your
- own analysis. Following these examples will answer a number of questions
- you may have about how to use Kwikstat.
-
- The first example shows you how to create a custom database and calculate
- some simple statistics and a graph. The second example shows you hot to
- use a pre-defined database structure to perform a t-test.
-
-
-
-
- --------------------------------------------------------------------
- Become a Registered User, Print Order Form
-
-
- 9
-
-
-
- KWIKSTAT CONDENSED MANUAL Version 3.3
- --------------------------------------------------------------------
-
-
- EXAMPLE 1
-
- DESCRIPTIVE STATISTICS EXAMPLE
-
- This example shows you how to enter data and perform some simple
- statistics and graphs. It will show you both the spreadsheet and
- database entry screens.
-
- The data that will be used is listed below. The GRADE variable is the
- grade received in the class, AGE is age, SEX is sex, WT is weight and
- SCORE is the score on a pre-test (maximum of 25 points). In database
- language, these variables are called fields.
-
- GRADE AGE SEX WT SCORE
- 1 A 18 M 165 22.3
- 2 B 19 M 145 22.8
- 3 B 17 F 122 22.8
- 4 C 18 M 196 18.5
- 5 B 17 M 188 19.5
- 6 B 18 F 140 23.5
- 7 C 19 F 121 22.6
- 8 B 20 F 112 21.0
- 9 C 19 F 122 20.9
- 10 A 18 M 176 22.5
- 11 B 18 M 165 23.3
- 12 A 19 M 135 21.8
- 13 A 18 F 121 24.8
- 14 C 19 M 186 16.5
- 15 B 17 M 148 18.5
- 16 A 18 F 140 24.5
- 17 B 16 F 101 23.6
- 18 A 21 F 111 20.0
- 19 B 17 F 124 21.9
- 20 B 18 M 176 21.5
-
- Before performing any kind of analysis on this data, you must first enter
- it into a Kwikstat database. The process is:
-
- 1. Create a database
- 2. Enter the data
- 3. Perform an analysis
-
- These processes will be explained in the next few sections.
-
-
-
-
- --------------------------------------------------------------------
- Become a Registered User, Print Order Form
-
-
- 10
-
-
-
- KWIKSTAT CONDENSED MANUAL Version 3.3
- --------------------------------------------------------------------
-
-
- CREATE A DATABASE
-
- When you begin the Kwikstat program, the main Kwikstat menu appears.
- If you have not yet created the database, you must choose the Create a
- new database option, which will lead you through the steps in creating
- a database. This section describes that procedure.
-
- Note: Once a database has been created, you can use the data in it
- again by choosing "Open a database to use" from the Data menu. When
- you choose Create a new database from the Data menu, you will then be
- prompted to enter the name of the database, as shown in figure 2.6.
- You need to enter a name for the database that is a DOS compatible
- file name such as MYDATA.
-
- Once you have entered a filename for the database, you can choose from
- a list of pre-defined database structures, or create your own. In this
- example, you will create your own database structure. From the menu
- shown in figure 2.7, choose the CREATE A CUSTOMIZED DATABASE
- option. For each field (each item of data) in the database, you must
- specify a fieldname, a type a width and optionally the number of
- decimal places. For the data in this example, you will use the
- following information:
-
- Field name Type Width Dec
- GRADE C 2
- AGE N 3
- SEX C 2
- WT N 4
- SCORE N 5 1
-
- The GRADE and SEX variables are of type "C" (Character) and the rest
- of the variables are numbers "N". Notice that the widths defined here
- are actually 1 character wider than actually needed. If you are not
- pressed for space in the database, this will make your listings easier
- to read. Only the SCORE variable requires a decimal value. Enter the
- information about the database structure into the database definition
- screen (see the section,"Define the Fields in Your Database" above) as
- shown in figure 2.9.
-
- When you will be prompted with the question,
-
- Enter records (data) into the database now?
-
- Type a Y to begin entering records into the database.
-
-
-
- --------------------------------------------------------------------
- Become a Registered User, Print Order Form
-
-
- 11
-
-
-
- KWIKSTAT CONDENSED MANUAL Version 3.3
- --------------------------------------------------------------------
-
-
- ENTER THE DATA
-
- When you choose to enter the data in a new database, an entry screen will
- appear listing the names of all of the fields and an area to enter the
- data. Kwikstat includes two types of data entry screen, database type and
- spreadsheet type. In the Setup routine, you chose one of these two entry
- options. The following discussion shows you how to enter data in either
- screen.
-
- TIP: You can toggle between spreadsheet entry mode and database entry
- mode by pressing the F8 (Switch) key.
-
- USING A SPREADSHEET ENTRY SCREEN
-
- The spreadsheet screen, as shown in figure 2.10 looks similar to a
- spreadsheet. If you prefer to use the database entry mode, skip to the
- section titled "Using a database entry screen."
-
- The names of the database fields (Grade, Age, etc.) are listed at the top
- of the screen (columns) and the record numbers are listed down the left
- side of the screen (rows). Since you do not have any records entered into
- the database, the only row displayed is the -ADD- row, which indicates
- that you are adding a new record.
-
- To enter data into the database, begin typing the entry for the first
- field (GRADE). Type an A (upper case), then press Enter. Your cursor
- moves to the next field (AGE). Type 18 and press Enter. Type upper case M
- and press Enter. Continue until you have entered 22.3 in the SCORE field.
- When you press Enter after entering 22.3, a new row appears to allow you
- to enter the second record of information, and your cursor moves to the
- first field of this record. Continue entering information in the
- spreadsheet until all records are entered. If you make a mistake on a
- record, you can use the right or left arrow keys to move your cursor and
- correct the mistake. If you discover that you have made an error in a
- previous record, you can use the Edit mode (described later) to correct
- this entry. When you have finished entering the information in the
- database, your screen will look like figure 2.11.
-
- To end the entry procedure, press the F7 (Exit) key. A message will
- appear on the screen:
-
- Before exiting, do you want to save the
- record number 21
- <yes> <No>
-
- Answer No to this question since you do not want to have a blank record
- in your database.
- --------------------------------------------------------------------
- Become a Registered User, Print Order Form
-
-
- 12
-
-
-
- KWIKSTAT CONDENSED MANUAL Version 3.3
- --------------------------------------------------------------------
-
-
-
- CORRECTING ERRORS IN THE DATABASE
-
- Before returning to the main menu, you can correct errors by pressing the
- F2 key to toggle into Edit mode. The edit screen is similar to the screen
- used to enter data. Use the cursor keys to move to the field to edit, and
- change the value. Exit the edit screen with the F7 (Exit) command.
- If you end up with an extra record in your database, you can erase that
- record while in the Edit mode. To erase a record, place your cursor on
- the record and press F4 (Erase). The record will be permanently removed
- from the database. Exit the edit screen with the F7 (Exit) command.
-
- USING A DATABASE ENTRY SCREEN
-
- This section describes how to enter data using the database entry mode.
- When you begin entering data into a new database, an entry screen for
- record 1 appears on the screen similar to figure 2.12. (Information for
- record 1 already entered).
-
- The database entry screen displays each field name at the left of the
- screen followed by an entry field where you will enter the data for that
- field. For example, when the entry screen first appears, your cursor will
- be in the GRADE field. To enter the information for record 1, type the
- grade value for the first record, A (upper case) and press Enter. Your
- cursor will move to the next field. Type 18 and press Enter. Continue
- until you have entered 22.3 in the SCORE field. When you press Enter
- after entering 22.3, a new entry screen appears for record 2 to allow you
- to enter the second record of information, and you cursor moves to the
- first field of this record. Continue entering information until all
- records are entered. If you make a mistake on a record, you can use the
- arrow keys to move your cursor and correct the mistake. If you discover
- that you have made an error in a previous record, you can use the Edit
- mode (described later) to correct this entry.
-
- ENTRY AND EDIT SCREEN FUNCTION KEY COMMANDS
-
- When you are adding information to the database, there are several
- function key options that you can choose. These options are listed at the
- bottom of the entry screen. To choose an option, press the function key
- related to the option, or point to the option with the mouse and click.
-
- F1 - Displays the Kwikstat Help menu.
- F2 - Toggles between edit mode and append mode.
- F3 - Marks a record for deletion. (Same as ^U.) Also undeletes
- records.
- F4 - Erase the current record permanently from the
- database. (Only in spreadsheet entry mode.)
- --------------------------------------------------------------------
- Become a Registered User, Print Order Form
-
-
- 13
-
-
-
- KWIKSTAT CONDENSED MANUAL Version 3.3
- --------------------------------------------------------------------
-
-
- F5 - Goto a record number.
- F6 - Undo - returns last record changed to it previous values.
- F7 - Exits entry mode and returns you to the main Kwikstat menu.
- F8 - Switches between spreadsheet type entry and database entry mode.
- F9 - Insert or Delete a field in the database or Replace the
- contents ofa field.
- F10 - Prints the contents of the current record to a printer or file.
-
- PERFORMING AN ANALYSIS
-
- Once you have entered your data into the database, you are ready to
- perform one or more analyses. Exit the data entry mode by pressing the
- F7 - Exit key. You will return to the Kwikstat main menu. All of the
- Kwikstat analysis procedures are listed in the Analyze menu. With the
- Kwikstat main menu displayed, you can press the right or left arrow
- key to pull-down the Analyze menu.
-
- For the MYDATA database you have just created, you will calculate some
- summary statistics and display a graph. The sections below lead you
- through these procedures.
-
- CALCULATING SUMMARY STATISTICS
-
- From the Analyze menu on the main Kwikstat menu screen, choose the
- Descriptive Statistics and Graphs option. A new menu will appear
- containing the options for the Descriptive Statistics and Graphs
- program module.
-
- The Descriptive Statistics and Graphs menu, lists the statistics and
- graphs you can produce from information in the current database. For
- example, suppose you want to calculate summary statistics for all of
- the numeric variables in your database. To do this, select the option
- called Summary Statistics on a number of variables. A screen will
- appear prompting you to specify what fields to use in the
- calculations. A list of the variables in the database appears. Enter
- the field numbers for each variable you want included in the analysis,
- separating each variable number with a comma. For example, if you want
- to choose variables 2 (AGE), 4 (WT) and 5 (SCORE), enter
-
- 2,4,5
-
- at the prompt, as shown at the Enter: prompt in figure 2.15 and press
- Enter.
-
- Once you select the variables to use, you are asked if there is a
- grouping variable.
-
- --------------------------------------------------------------------
- Become a Registered User, Print Order Form
-
-
- 14
-
-
-
- KWIKSTAT CONDENSED MANUAL Version 3.3
- --------------------------------------------------------------------
-
-
- Enter a grouping variable by number or Enter for none:
-
- A grouping variable allows you to calculate summary statistics by
- group, such as SEX. You could enter a 3 at this prompt to specify that
- you want the summary statistics broken down by SEX, but for this
- example, simply press Enter to specify no grouping variable. The next
- question asks you if you want to
-
- Choose (C)ontinue (F)ile (P)rint or (R)eturn to menu:
-
- To continue with the calculation and display the results on the
- screen, just press Enter. When you do, the summary statistics for the
- selected variables will appear on the screen. Once you have examined
- these results, press Enter to return to the Descriptive Statistics and
- Graphs menu.
-
- Example 2
- Using a Pre-defined Structure
-
- This example shows how you would perform an independent group t-test
- in Kwikstat using one of the pre-defined database structures. The
- example uses the spreadsheet entry type. The data used in this example
- is the same as in example 4.7 in the manual (page 4-23 and following.)
- In this example, 13 plants were randomly allocated to two groups.
- Group one received the present fertilizer and group 2 received a newer
- fertilizer. After a period of time, you observed the heights of the
- plants were observed. The results are:
-
- Data for independent group t-test (fertilizer study)
-
- Present Newer
- Fertilizer Fertilizer
- 46.2 cm 51.3 cm
- 55.6 52.4
- 53.3 54.6
- 44.8 52.2
- 55.4 64.3
- 56.0 55.0
- 48.9
-
- In order to enter this into a database, you must assign group numbers
- (or letters) to each group. For example, we will call the "Present
- Fertilizer" group 1 and the "Newer Fertilizer" group 2.
-
-
-
-
- --------------------------------------------------------------------
- Become a Registered User, Print Order Form
-
-
- 15
-
-
-
- KWIKSTAT CONDENSED MANUAL Version 3.3
- --------------------------------------------------------------------
-
-
- CREATING THE DATABASE AND ENTERING DATA
-
- Since the observations are independent, the database will include
- thirteen records (one for each plant) and two fields (one for the
- response and one for the group indicator).
-
- Choose to create a database named TTEST. A screen with the instruction
- "Choose the database type to create from the menu below" will appear.
- Since you are performing an independent group t-test, you can select
- the option titled For Independent Group t-Test or ANOVA from this
- list. This process automatically builds a database structure suitable
- for entering data for this kind of analysis. In this case, the
- database will contain a grouping field (where you will enter a 1 or 2,
- the fertilizer type) and an observation field (where you will enter
- the height.)
-
- Once you have selected a database type, you will be asked if you want
- to enter records now. Answer Yes.
-
- Enter the data into the database. The data you will enter in the first
- record is 1 (press Enter) and 46.2 (press Enter). When you type the
- 46.2 and press Enter, your cursor will automatically move to record
- number 2, where you will enter 1 and 55.6, and so on. Enter the data
- for the thirteen records. For each record of a "Present Fertilizer"
- observation, enter "1" for the GROUP variable. For the "Newer
- Fertilizer" observations enter a "2" for the GROUP variable. The
- eighth record is 2 and 51.3. Figure 2.21 shows a screen where all 13
- records have been entered, and the program is waiting for a 14th
- record to be entered.
-
- Since there is no 14th record, press the F7 function key (Exit) to end
- the data entry process. You will be prompted with the question, Do you
- want to save the record number 14?" You do not want to save this blank
- record, so answer N (No). KWIKSTAT will return to the Data main menu.
-
- PERFORMING THE ANALYSIS
-
- Once you have entered the data into a database, and you are back at
- the main menu, select the Analyze option at the top of the main
- Kwikstat menu screen. When the Analyze pull-down menu appears, select
- the t-tests and Analysis of Variance (ANOVA) option.
-
- When you choose the t-tests and Analysis of Variance (ANOVA) from the
- Analyze menu, the menu shown in figure 2.23 will appear. Select
- Compare independent groups (t-test, ANOVA). You will be prompted to
- choose the field name of the group, which in this case is simply
- GROUP. Choose GROUP. Next, you will be asked for the data field.
- --------------------------------------------------------------------
- Become a Registered User, Print Order Form
-
-
- 16
-
-
-
- KWIKSTAT CONDENSED MANUAL Version 3.3
- --------------------------------------------------------------------
-
-
- Choose OBS (HEIGHT), the response variable. KWIKSTAT will now perform
- the calculations and display the results on the screen.
-
- Refer to the section "Using t-tests and ANOVA Procedures" in Part IV
- of this manual for information on interpreting this information. When
- the results screen is displayed, typing G will cause a graphical
- comparison of the two samples. First, a screen containing Tukey's five
- number summaries (listing the 0, 25, 50, 75, and 100 percentiles for
- each group) appears. Press Enter, and box plots for each group will
- appear as shown. Press Esc to exit from the box plots. You will be
- given the option to print a report for this analysis. If you choose
- this option, a summary of the analysis will be printed to the printer
- or to a file. After you have printed the report (or chosen not to
- print the report), you will return to the module menu. To return to
- the main Kwikstat menu, press Esc. To end the program and return to
- DOS, choose the F option, Quit to DOS.
-
- ENTERING DATA INTO THE DATABASE
-
- When you choose the Data entry option from the DATA menu, you will be
- asked to specify entry from the keyboard or from a file (ASCII file).
- For most small data sets, you will probably enter data from the
- keyboard. If another program supports ASCII, dBASE and 1-2-3 type
- files, you may be able to enter data from that program in to KWIKSTAT.
- The following information describes how to enter data from the
- keyboard, from an ASCII file or from other programs.
-
- ENTERING DATA FROM THE KEYBOARD
-
- If you choose KEYBOARD data entry, an entry screen will appear
- containing the fields you created in the CREATE option. This entry
- screen will either use the database or spreadsheet format mode,
- depending on which one you specified when you chose setup options.
- However, you can easily toggle from one entry mode to another by
- pressing the F8 key. Examples 1 and 2 above describe these two entry
- methods.
-
- ENTERING DATA FROM AN ASCII FILE
-
- When you choose to enter data from an ASCII file, you will be asked
- the name of the raw data file. (i.e., \MYDIR\MYDATA.DAT). The data
- from the ASCII file will be entered into the database, and a count of
- the records as they are entered will be displayed. If there are
- already records in the file, the new data from the ASCII file will be
- appended (added) as new records to the database. It is a good idea to
- go to the List procedure to look at the data to verify that it has
- been entered correctly, or print the data out using the report option
- --------------------------------------------------------------------
- Become a Registered User, Print Order Form
-
-
- 17
-
-
-
- KWIKSTAT CONDENSED MANUAL Version 3.3
- --------------------------------------------------------------------
-
-
- (See Using KWIKSTAT Utilities). If the data does not match the fields,
- refigure the widths of each field to make sure it matches the columns
- of data in the disk file, and try again.
-
- KWIKSTAT can read data from standard ASCII text files. These kinds of
- files are usually supported by most word processing programs (such as
- WordPerfect DOS Text Mode) as well as most text editors such as EDLIN.
- Data must be in the form of column data, like this...
-
- A 22 3.3 WF
- A 33 4.2 BF
- B 27 3.3 WM
- :
- ETC
-
- Notice that each column of data is in fixed fields. It does not matter
- that there is no space between the last two fields (Race and Sex)
- since the program will pick off the information from the column and
- does not require that there be spaces between the columns. Use the
- instructions below to prepare the KWIKSTAT (dBASE) database structure
- to be used to read in ASCII data.
-
- The steps to enter ASCII data into KWIKSTAT are:
-
- STEP 1. Use the CREATE option to create a database structure to match the
- columns in the ASCII file. The field widths MUST match the width of the
- columns of data on file. If there are spaces between columns of data,
- make widths wide enough to account for those spaces. The following data
- is from the file EX.DAT on disk:
-
- A 12 22.3 25.3 28.2 30.6 5
- A 11 22.8 27.5 33.3 35.8 5
- B 12 22.8 30.0 32.8 31.0 4
- A 12 18.5 26.0 29.0 27.9 5
- B 9 19.5 25.0 25.3 26.6 5
- :
- etc
- :
- B 12 22.4 27.2 31.8 35.6 4
-
- Try your hand at doing this example by creating a database named EX with
- the following structure:
-
-
-
-
-
- --------------------------------------------------------------------
- Become a Registered User, Print Order Form
-
-
- 18
-
-
-
- KWIKSTAT CONDENSED MANUAL Version 3.3
- --------------------------------------------------------------------
-
-
- FIELD NAME TYPE WIDTH DECIMALS
- ----------- ------ ------- --------
- GROUP C 2
- AGE N 4 0
- TIME1 N 5 1
- TIME2 N 5 1
- TIME3 N 5 1
- TIME4 N 5 1
- STATUS N 2
-
- Notice that even though the first column has data 1 column wide, this
- structure uses a width of 2 for GROUP. Even though the age only uses 2
- columns, the structure calls for AGE to have a width of 4. These
- widths are enter this way to take care of the blank spaces between the
- columns. If GROUP had been set up with only 1 column and AGE with only
- 2 columns, the ASCII data would not be read into the database
- correctly. Create the database called EX with the specifications
- listed above, then go to the next step.
-
- NOTE: KWIKSTAT can also produce an ASCII text file, so that data
- created in KWIKSTAT can be output, and read into other programs.
-
- STEP 3: To verify that the data was read properly, use List option on
- the DATA menu to examine the resulting database.
-
- ENTERING DATA FROM ANOTHER PROGRAM
-
- KWIKSTAT was designed to read dBASE, comma delimited ASCII and 1-2-3
- files because these are among the most commonly used types of files to
- store data. See information on the utility module later in this
- manual.
-
- EDITING, DELETING AND PACKING DATA
-
- Once a database is created, you often need to correct information by
- editing records or getting rid of records. The following sections
- describe the process of changing the contents of a record by editing and
- a procedure for getting rid of records by deleting and packing.
-
- EDITING RECORDS
-
- If there is a need to change data already in a database, you may choose
- the Edit a record option from the DATA menu. You will be asked to specify
- the record number you wish to edit. Editing is similar to entering data.
- Use the up and down arrow keys to move from field to field within a
- record. See the Examples above for a tutorial on editing information in
- a database.
- --------------------------------------------------------------------
- Become a Registered User, Print Order Form
-
-
- 19
-
-
-
- KWIKSTAT CONDENSED MANUAL Version 3.3
- --------------------------------------------------------------------
-
-
-
- DELETING RECORDS
-
- If you want to delete an entire record within a database, use the edit
- procedure to display the record to delete.
-
- You can use one of two methods to delete a record:
-
- 1. Erase the record (Spreadsheet mode only) or
- 2. Delete and Pack
-
- To erase a record, display the record in Edit mode, using the spreadsheet
- entry mode. Highlight the record to erase, and press the function key F4.
- If you want to delete more than a few records, it will probably be more
- efficient to use the Delete and Pack method. While a record is displayed
- (either in spreadsheet or database edit mode), press ^U to mark the
- record for deletion. A **DEL** will appear on the screen (upper right
- corner) of a "deleted" record (database mode) or a "*" will appear next
- to the field name (spreadsheet mode). You can mark as many records as you
- choose. If you accidentally mark a record for delete, pressing ^U a
- second time will cancel the mark, and the **DEL** will disappear from the
- screen. Once you have marked the records for delete, pack the database,
- as described below.
-
- TIP:If you want to temporarily "get rid" of a few records so that they
- will not be used in an analysis, mark them for delete. Any analysis you
- perform will ignore deleted records. Then, if you want to restore them,
- unmark them again. This is a quick way to see how an analysis result
- would change if some selected records were not present in the analysis.
-
- PACKING THE DATABASE
-
- The records marked for delete are not actually deleted from the file at
- this point. However, they will be ignored in most analyses, and will
- continue to be displayed when you edit the database. You can undelete
- records from the Edit mode.
-
- If you want to permanently get rid of the records you have marked for
- delete, choose the Pack procedure from the Data menu. This procedure
- erases all "deleted" records from the database.
-
-
-
-
-
-
-
- --------------------------------------------------------------------
- Become a Registered User, Print Order Form
-
-
- 20
-
-
-
- KWIKSTAT CONDENSED MANUAL Version 3.3
- --------------------------------------------------------------------
-
-
- MODIFYING AND DISPLAYING THE STRUCTURE
-
- The Modify or Display database structure option on the DATA menu allows
- you to display the structure of your database, and allows you to change
- characteristics about the database structure. When you choose to display
- the structure, a list of all field names, their types, widths and
- decimals (if any) are listed.
-
- When you choose to modify a field, you age given a chance to the modify
- the characteristics of that field. Your options are:
-
- Delete the Field
- Change Name of Field
- Change Type of Field
- Change Width of Field
- Change Number of Decimal Places
-
- If you change the type of field, say from character to numeric, the
- program will attempt to convert the contents of the field to its new
- type. When you modify a database, you will be asked to enter the name of
- a new database. This means that the modified database will be in a new
- file, and your old original database will remain intact. If you no longer
- want the old database, you must delete it by choosing the Kill option
- from the Data menu.
-
- SETTING MISSING VALUES CODES
-
- Sometimes in the collection of data there are values that are lost or
- cannot be gathered. These are called "missing values". When such values
- occur, it is important for the program to know that the values are
- missing so that statistical calculations may take this into account.
- Missing values are usually designated as an impossible value. For
- example, the missing values designated for the variable AGE may be -9,
- since it is impossible for the variable AGE to have the value -9. When
- the program is asked to calculate the mean of age, for example, it will
- ignore those records where AGE is -9 in that calculation if -9 has been
- specified as the missing value code. In most KWIKSTAT procedures, there
- is a casewise deletion of the record from calculation whenever a missing
- value is encountered.
-
- Once you designate a missing value code for a variable, it is up to you
- to make sure that this code gets placed into your database in the proper
- records and fields. For example, if you have designated -9 as the
- missing value code for AGE, you must make sure that in your database a -9
- appears in the field AGE if that data is missing or unknown.
- A standard dBASE III file does not have a way to designate missing
- values, but KWIKSTAT allows a way for you to designate these values in
- --------------------------------------------------------------------
- Become a Registered User, Print Order Form
-
-
- 21
-
-
-
- KWIKSTAT CONDENSED MANUAL Version 3.3
- --------------------------------------------------------------------
-
-
- this program. The Indicate missing value codes option on the DATA menu is
- used to set up these values. When this option is selected, the program
- will display an entry screen that is similar to a data entry screen. You
- may enter one missing value for each field name. The missing value must
- obey the definition of the field in terms of length and type.
- Once missing values are entered, they are stored on disk in a file named
- filename.MV, where "filename" is the name of the designated database. If
- a new variable is created using the transformation procedure, its missing
- value is appended to the missing value file.
-
- You may change or correct the missing values for a database at any time
- by calling up this option. If missing values are already designated for
- the database, they will be displayed on the entry screen, and you may
- edit them or accept them as they are.
-
- IMPORTANT NOTE: If missing values are NOT used, and there is a blank
- numeric variable in a calculation, it will be treated like the value 0
- (zero), so it is important to use missing values if your data contains
- such entries. Otherwise, the statistical calculations will be in error!!
-
- CREATING A NEW FIELD
-
- In previous versions of Kwikstat, to create a new variable, you chose
- the Transformation option. This option only allowed the creation of
- numeric variables. This option have been replaced by a procedure in
- the Edit mode, which allows you to create new blank fields of any
- field type, and to place information in those fields that is either a
- numeric or character expression. Thus, this procedure replaces the old
- Transformation procedure. The sections "Create a New Field" and
- "Replacing the Contents of a Field" describe these procedures. You may
- create a new field in a database within an edit screen by choosing the
- F9 (FIELD Insert) option. After creating a new field, you can then use
- the F9 (FIELD Replace) option to place a value in the new field.
-
- When you choose the Field option in the edit screen (F9), you will be
- prompted to enter information about the new field.
-
- Define a name for the new field
- Define the field type
- Define a width for the new field
-
- For numeric variables,
- Define the number of decimals, if any
- Define a missing value code. If none is selected, it is assumed to be 0
- (zero).
-
- CAREFUL ATTENTION must be paid to the definition to assure that the
- --------------------------------------------------------------------
- Become a Registered User, Print Order Form
-
-
- 22
-
-
-
- KWIKSTAT CONDENSED MANUAL Version 3.3
- --------------------------------------------------------------------
-
-
- calculated numbers will fit into the field width specifications. If the
- calculated number is too large to fit into the field, it will be given
- the missing value code. If an illegal calculation is attempted, such as a
- division by 0, the result will be missing. If a calculation includes a
- missing value, the result will be a missing values.
-
- TIP: To create a new field containing a new value that is a numeric
- transformation of other fields, first insert the new field using the F9
- Field/Insert option, then use the F9 Field /Replace option to place the
- value in the new field.
-
- REPLACING CONTENTS OF A FIELD
-
- You can use the F9-Field Replace option in the Edit screen to replace the
- existing contents of a field, or place new information in a newly created
- blank field. Kwikstat provides a number of numeric and character
- functions to enable you to do this.
-
- For example, if you wanted to replace the contents of the field RATIO
- with the values WEIGHT/HEIGHT:
-
- 1) In the edit mode, highlight the field whose contents you want to
- replace. Press the F9 (Field) option, and choose Replace the contents of
- a field option from the Field menu. A dialog box will appear.
-
- 2) Specify which records to replace. The default it ALL, which means all
- records in the database. Or, enter a range such as 1-20, which would mean
- only perform the replacement in records number 1 through 20. Then, press
- Enter.
-
- 3) Specify what to place in the field. For example, enter the formula
- WEIGHT/HEIGHT in the Replace With entry field, where WEIGHT and HEIGHT
- are two other fields in the same database.
-
- 4) Specify any condition for replacing, if any. The default is NONE. For
- example, if you only want the replacement to be for records whose value
- of AGE is greater than 20, you would enter the expression AGE20 in the
- condition entry field.
-
- 5) Press F7 when you have finished entering the Replace information, and
- the replace will begin. When it is finished, you will return to the edit
- screen. An example of the Replace dialog box is shown in figure
- 2.28.
-
- The kinds of expressions you can use the Replace With and Condition
- fields are described below.
-
- --------------------------------------------------------------------
- Become a Registered User, Print Order Form
-
-
- 23
-
-
-
- KWIKSTAT CONDENSED MANUAL Version 3.3
- --------------------------------------------------------------------
-
-
- KWIKSTAT supports two kinds of expressions. One is strictly for
- mathematical expressions, called a math expression. Thhe other expression
- type, called a database expression, allows the use of character, numeric,
- date and logical fields in the expression. Here are the criteria for when
- these are used:
-
- REPLACE WITH FIELD: Use either a math expression or a database
- expression.
-
- CONDITION FIELD: Use only a database expression.
-
- In the REPLACE WITH field, the default expression type is the database
- type. In order for an expression to be evaluated as a strictly math
- expression, you must place an equal sign "=" at the beginning of the
- expression.
-
- The major difference between the two expression types are in their
- capabilities. The database expression can handle most common
- calculations, including simple math, string evaluation, and date
- evaluation. The math expression can be used only for strictly numeric
- calculations using one or more of the functions listed in the table
- below, or that uses the exponentiation operator.
-
- For example, if you want to perform the calculation WEIGHT/HEIGHT, you
- can enter the expression as-is in the REPLACE WITH field. However, if you
- want to calculate the log of WEIGHT/HEIGHT, you must enter the experssion
- as
-
- =LOG(WEIGHT/HEIGHT)
-
- since the LOG function is not supported as a database expression
- function. The equal sign signals to the program to use the math
- calculator. The information below outlines the capabilities of both
- expression types.
-
- Mathematical operators supported are Add +, Subtract -, Divide
- /, Multiply *, and Exponentiation ^ (Math calculator only). For
- Character fields, the database calculator supports the operation:
- Add + (appends one string to another).
-
- Following are a few examples of correct expressions:
-
- AGE/HEIGHT
- =SCORE^2 (= signals math calculator)
- LTRIM(FIRST)+' '+LAST
-
- Note: Literal strings included in expressions must be surrounded by
- --------------------------------------------------------------------
- Become a Registered User, Print Order Form
-
-
- 24
-
-
-
- KWIKSTAT CONDENSED MANUAL Version 3.3
- --------------------------------------------------------------------
-
-
- single quotes. For example, 'Hello' is a literal string. Character
- field names are used without quotes. For example, NAME is a field
- name. A correct string expression using these two strings would be:
- 'Hello '+NAME TIP:Unless you use scientific functions in your
- calculations, you don't need to be concerned about which calculation
- type to use. Only if you use a numeric operation or function not
- supported by the database calculator will you need to place an equal
- (=) sign at the first of the expression.
-
- Database calculator functions supported The following functions may be
- used in expressions both in the Replace With and Condition
- fields.
-
- Database Calculator Functions
-
- ABS(NUM), ASC(STG), AT(STG1,STG2), CALENDAR(NUM), CAPS(STG), CHR(NUM),
- DATE(), DELETED(), IIF(LEXP,AEXP1,AEXP2), INT(NUM), JULIAN(DATE),
- LEFT(STG,NUM), LEN(STG), LOWER(STG), LTRIM(STG), MAX(NUM1,NUM2),
- MIN(NUM1,NUM2), RECNO(), REPLICATE(STG,NUM), RIGHT(STG,NUM),
- RTRIM(STG), SPACE(NUM), STR(NUM), STRING(NUM,NUM|STR), RIGHT(STG,NUM),
- STUFF(STG,NUM,NUM,STG2), SUBSTR(STG,NUM,[NUM]), TIME(), TRIM(STG),
- UPPER(STG), VAL(STG),
-
- Following a few example uses of these functions:
-
- ASC - Converts the first character of a string to its ASCII code. For
- example, the function ASC('A') would return the value 65, since 65 is
- the code for an uppercase A.
-
- AT - Returns the starting position of one character string within
- another character string. For example, the expression AT('Bill', 'Wild
- Bill') = 5 since the string 'Bill' begins five characters deep in the
- string 'Wild Bill'.
-
- CHR - Converts a number into its ASCII value. For example, CHR(65) is
- equal to the character string 'A'.
-
- DELETED - Returns a T if the current record is marked for delete, else
- it returns a F. Can be used to conditionally replace a value depending
- on if the record is deleted or not.
-
- INT - Rounds down to nearest integer. INT(3.2) would be returned as 3.
-
- LEFT and RIGHT - Returns the left or right portion of a string. For
- example, LEFT('Wild Bill',3) would return the string 'Wil' and
- RIGHT('Wild Bill',3) would return the string 'ill'.
-
- --------------------------------------------------------------------
- Become a Registered User, Print Order Form
-
-
- 25
-
-
-
- KWIKSTAT CONDENSED MANUAL Version 3.3
- --------------------------------------------------------------------
-
-
- LOWER and UPPER - Returns lower or upper case string. For example,
- LOWER('Wild Bill') would return 'WILD BILL'.
-
- LTRIM, RTRIM and TRIM - Trims blanks from right, left or both ends of
- a string. For example, LTRIM('Wild Bill ') would return 'Wild
- Bill'.
-
- VAL - Returns the value of a string. For example VAL('24') is the number
- 24.
-
- Most of these functions are similar to or identical to functions used in
- the BASIC language or in dBASE or other database programs. For more
- examples, you might refer to documentation on these programs.
- MATH EXPRESSIONS
-
- The following functions are supported only in the Replace With entry
- field, and only for numeric field types. You MUST preceed expressions
- using these functions with an = sign.
-
- An example of the RECODE function, which appears on the following table
- is:
-
- =RECODE(SCORE,1,AGE,10,15)
-
- The five arguments in the RECODE function are:
-
- No. Example Meaning
- 1 SCORE Field to use in compare
- 2 1 Value to assign if comparison is true
- 3 AGE Value to assign if comparison is false
- 4 10 Low range of field to compare
- 5 15 High range of fields to compare
-
- Thus, this example means that the value of the RECODE is 1 if SCORE is
- between 10 and 15, else the value is the current value of the AGE field
- for that record.
-
- Math Calculator Functions
-
- ABS(NUM), ASIN(NUM), ATAN(NUM), ATAN2(y,x), CSC(NUM), COS(NUM),
- COT(NUM), EXP(NUM), INT(NUM), LN(NUM), LOG(NUM), MAX(1,T2,3)
- MIN(1,T2,T3), MOD(NUM1,NUM2), PI, RAND, RECNO,
- RECODE(NUM1,NUM2,NUM3,NUM4,NUM5,NUM6), ROUND(NUM,DEC), SEC(NUM),
- SIN(NUM), SQRT(NUM), SUM(NUM1,NUM2...), TAN(NUM),
-
-
-
- --------------------------------------------------------------------
- Become a Registered User, Print Order Form
-
-
- 26
-
-
-
- KWIKSTAT CONDENSED MANUAL Version 3.3
- --------------------------------------------------------------------
-
-
- SUBSETTING THE DATABASE
-
- The Subset database option on the DATA menu allows you to create a new
- database from an old database. The new database can be a subset of the
- old one, using a conditional criteria for outputting information from the
- old database to the new one.
-
- For example, suppose you have a database with a field GROUP with values
- 1, 2, 3, 4 and 5. You want to create a database that does NOT include
- Group 5. After choosing Subset database from the DATA menu, you are
- asked for the name of the new database. For example, your new database
- might be named NO5.DBF. You are asked for the field name to be used in
- the selection criteria. In this case, you would choose the field named
- GROUP. Next you must enter the selection relationship. It will be
- described as a numerical expression. The conditional operators you may
- use are:
-
- = > < >= <= <> =
-
- and the logical operators .NOT., .AND., and .OR..
-
- It is important that a dot (.) appear before and after each
- logical operator. For example, you might enter a condition such as
-
- AGE <10 .OR. SEX='M'
-
- When you choose the Subset option from the Data menu, a Subset dialog
- box appears on the screen.
-
- There are two items you must enter in the Subset dialog box. First is a
- name for the new database. This must not be the same name as the current
- database. Then, you must enter the subset criteria. Example of subsetting
- criteria are:
-
- GROUP = 4
- GROUP> STATUS
- GROUP < WEIGHT*HEIGHT
- TIME1 = TIME2*1.96
- SEX = 'F'
- TIME1 <=20 .AND. SEX = 'M'
-
- When creating these expressions, you can use the same functions as was
- previously described in the table "Functions Supported for Character,
- Date and Value Expressions."
-
-
-
- --------------------------------------------------------------------
- Become a Registered User, Print Order Form
-
-
- 27
-
-
-
- KWIKSTAT CONDENSED MANUAL Version 3.3
- --------------------------------------------------------------------
-
-
- LISTING THE DATABASE TO THE SCREEN
-
- The LIST option on the DATA menu allows you to look at the information in
- your database. The list produces an on-screen report that lists the data
- one record at a time. If your database contains too many fields to be
- displayed on the screen at one time, the list procedure will ask you at
- which field to begin the display.
-
- ZAP A DATABASE
-
- The Zap option allows you to quickly erase all records from a
- database. To use this option, open a database, then choose Zap.
-
- KILL - DELETE A DATABASE
-
- The Kill option allows you to delete a database and its related
- missing values files (if any.) When you choose this option, a list of
- databases will appear on the screen. Choose the database to delete,
- and the file(s) will be erased from your disk.
-
- QUIT/EXIT KWIKSTAT
-
- Use this option to end the Kwikstat program and return to DOS.
-
-
-
- PART IV
- PERFORMING A STATISTICAL ANALYSIS
-
- This section of the KWIKSTAT manual describes the statistical analysis
- procedures available in the basic KWIKSTAT program.
-
- USING DESCRIPTIVE STATISTICS AND GRAPHS
-
- The Descriptive Statistics and Graphs module allows you to examine
- summary statistics of the data in a database. Graphics are used
- throughout KWIKSTAT to provide visual displays of the data.
-
- DETAILED STATISTICS ON A SINGLE VARIABLE
-
- This option calculates the mean, standard deviation, median, standard
- error of the mean, minimum, maximum, sum, and variance of a set of
- data. In this option, KWIKSTAT also calculates five percentiles and
- computes a two-sided confidence interval about the mean. If you do not
- specify otherwise, the default percentiles (Tukey's five number
- summary: 0, 25th, 50th, 75th, 100th percentiles) and default level of
- confidence (95%) are used. If the Tukey five number summary is used, a
- --------------------------------------------------------------------
- Become a Registered User, Print Order Form
-
-
- 28
-
-
-
- KWIKSTAT CONDENSED MANUAL Version 3.3
- --------------------------------------------------------------------
-
-
- box plot is also displayed. For sample sizes less than or equal to 30,
- a t-statistic is used to calculate for confidence interval. For sample
- sizes over 30, the 95% (two-sided) z-statistic (1.96) is used.
-
- SUMMARY STATISTICS ON A NUMBER OF VARIABLES
-
- This option is similar to the above Descriptive statistics on a single
- variable, but in this option several variables can be summarized using
- descriptive statistics (sample size, mean, standard deviation, minimum,
- maximum, and standard error of the mean).
-
- If you have a grouping variable in your database, you may request output
- of summary statistics by group. You are also given the opportunity to
- print results to the printer, or to output results to a file.
-
- APPROXIMATE P-VALUE DETERMINATION
-
- This option calculates p-values for entered values of four test
- statistics: normal (z), student's t, F, chi-square. If you designate the
- statistic being used, degrees of freedom and the calculated value of the
- test statistic, KWIKSTAT will tell you the p-value associated with that
- test statistic.
-
- PRODUCING A HISTOGRAM
-
- This procedure produces a histogram from values read from a database.
- A histogram can be helpful in determining if the distribution of a
- continuous variable is approximated by a normal distribution. If the
- histogram has a peak toward the center, with both tails diminishing,
- the data could be considered to be approximated by a normal
- distribution.
-
- PRODUCING AN XY-PLOT (SCATTERPLOT)
-
- This option enables you to produce a scatterplot of two variables. A
- scatterplot is simply a plot of all the data values plotted one variable
- against the other. Such a plot is helpful in determining if two variables
- are related, and if the relationship is linear (a straight line),
- curvilinear, or something else. This information is important for
- regression and correlation. (Scatterplots can also be produced from the
- Regression & Correlation module.)
-
-
-
-
-
-
- --------------------------------------------------------------------
- Become a Registered User, Print Order Form
-
-
- 29
-
-
-
- KWIKSTAT CONDENSED MANUAL Version 3.3
- --------------------------------------------------------------------
-
-
- EXAMPLE 4.5: SCATTERPLOT
-
- This example uses the EXAMPLE database file on the KWIKSTAT disk.
- Suppose you want to create and display a scatterplot of the TIME1
- variable against TIME2. First, you must retrieve the database:
-
- RETRIEVING THE DATABASE
-
- In the lower left corner of the screen you can see the name of the
- database currently in use. If it's the one you want, EXAMPLE in this
- case, go on to Performing the Analysis. If another database is in use,
- see Example 4.2 for detailed instructions on retrieving EXAMPLE.
-
- PERFORMING THE ANALYSIS
-
- From the Descriptive Statistics and Graphs menu, select XY-Plot
- (Scatterplot). You will be prompted to enter two fields, or variables,
- to use. Since you want to do a scatterplot of TIME1 against TIME2,
- enter 3,4. You will then be prompted for specifications for the plot.
- You may use the default settings (by simply pressing Enter at each
- prompt) or you may set your own. (The default setting is no grid lines
- and points not connected.) KWIKSTAT will draw the scatterplot
- according to the specifications. The disappearing menu at the bottom
- of the screen gives you the option to (P)rint. (Press Enter to make
- the bottom menu reappear.) You can use the (R)eplot option to play
- around with the specifications of the scatterplot. Pressing Esc takes
- you back to the Choose Analysis Option menu.
-
- Figure 4.4 shows the scatterplot of TIME1 and TIME2 of the EXAMPLE
- database using default specifications. This plot suggests that the two
- variables, TIME1 and TIME2 are related positively. As TIME1 increases,
- so does TIME 2.
-
- TIME SERIES PLOT
-
- This option enables you to produce a time-series plot for one variable.
- This plot is useful in examining data that is time related, such as
- profit by month, etc. The X axis is assumed to be "time". The data
- values must be entered into records in chronological order the
- observations occurred, i.e., the first record must contain the results
- of the first observation (first time period), etc.
-
-
-
-
-
-
- --------------------------------------------------------------------
- Become a Registered User, Print Order Form
-
-
- 30
-
-
-
- KWIKSTAT CONDENSED MANUAL Version 3.3
- --------------------------------------------------------------------
-
-
- USING T-TESTS AND ANOVA PROCEDURES
-
- T-tests and Analysis of Variance (ANOVA) procedures are used to test
- hypotheses about population means using data obtained through random
- sampling of those populations. For example, if you do an experiment in
- which you give different treatments to different groups (e.g., different
- fertilizers to different groups of plants), the response measurements
- (e.g., plant heights) will very likely all be different and the average
- responses for the different groups will be different.
-
- PARAMETRIC INDEPENDENT GROUP ANALYSIS
-
- Independent group analysis is appropriate when observations are taken
- from groups in which subjects in one group do not appear in another
- group. That is, the observations within as well as between groups are
- independent of one another. In this module, a t-test is performed when
- there are two groups, and an ANOVA is performed when there are three to
- ten groups being compared. When performing a t-test or ANOVA on two or
- more independent groups, you are testing the hypotheses:
-
- Ho: The difference in the means of the groups is zero.
- Ha: The difference in the means of the groups is not zero.
-
- For a two-sample t-test, two t-statistics are calculated, one for the
- case in which the variances of the two samples are equal and the other
- for use in the case of unequal variances. KWIKSTAT performs a test
- of the hypothesis that the variances are equal. If the p-value is
- small (e.g., less than 0.05), the hypothesis of equal variances is
- rejected and you use the t-statistic for unequal variances. If the
- p-value is large, use the t-statistic for equal variances. Since the
- observations are all independent of one another, each observation is
- entered as an individual record in the database. The number of data
- records must be the same as the total number of observations. Each
- record includes the response value of one observation and a number or
- character to indicate to which treatment group it belongs. That is,
- there will be two fields (variables), one in which to record the
- response and one in which to indicate the group.
-
- EXAMPLE 4.8. SINGLE FACTOR ANOVA
-
- When more than two independent groups are compared with respect to one
- variable, one-way or single factor analysis of variance techniques are
- appropriate.This example uses data for hogs which have been randomly
- assigned to four groups, with each group being given a different feed.
- The response is weight gain.
-
- Data for Independent Group ANOVA
- --------------------------------------------------------------------
- Become a Registered User, Print Order Form
-
-
- 31
-
-
-
- KWIKSTAT CONDENSED MANUAL Version 3.3
- --------------------------------------------------------------------
-
-
-
- Gp 1 Gp 2 Gp 3 Gp 4
- 60.8 78.7 92.6 86.9
- 67.0 77.7 84.1 82.2
- 54.6 76.3 90.5 83.7
- 61.7 79.8 90.3
-
- CREATING A DATABASE
-
- The database to analyze this data is similar to the one used for
- Example 4.7 above, differing only with respect to the number of
- groups. In fact, this one-way ANOVA is an extension of the t-test when
- there are three or more groups.
-
- Create a database (named e.g., HOGFEED) with two fields: GROUP (or you
- may want to call this field FEED) and WEIGHT. The groups will be
- numbered 1,2,3,4 according to the type of feed used. The contents of a
- database option, the HOGFEED database should look like this:
-
- RECNO GROUP WEIGHT
- 1 1 60.8
- 2 1 67.0
- 3 1 54.6
- 4 1 61.7
- 5 2 78.7
- 6 2 77.7
- 7 2 76.3
- 8 2 79.8
- 9 3 92.6
- 10 3 84.1
- 11 3 90.5
- 12 4 86.9
- 13 4 82.2
- 14 4 83.7
- 15 4 90.3
-
- PERFORMING THE ANALYSIS
-
- Select the Analyze menu. When the Analyze pull-down menu appears,
- choose the t-tests and Analysis of Variance (ANOVA) option. The
- t-tests and Analysis of Variance Choose Analysis Option menu will
- appear. Select Compare independent groups (t-test, ANOVA).
-
- You will be prompted to choose the field name of the grouping variable,
- which in this case is GROUP. Choose GROUP. Next, you will be asked for
- the data field. Choose WEIGHT, the response variable. KWIKSTAT will
- now perform the calculations and display the results on the screen, as
- --------------------------------------------------------------------
- Become a Registered User, Print Order Form
-
-
- 32
-
-
-
- KWIKSTAT CONDENSED MANUAL Version 3.3
- --------------------------------------------------------------------
-
-
- illustrated in Figure 4.7.
-
- The results of this test are summarized in the p-value. In this case,
- the small p-value (0.000) means that there is a significant difference
- between groups. The p-value (rounded off) is less than 0.0005. That
- is, the difference between these observed averages is so far from zero
- that the chance of getting differences farther from zero is less than
- five in 10,000 if the true means are equal (their difference is zero).
- This is taken as evidence of a "real" difference between feeds, a
- difference not due to chance. A p-value of this magnitude is often
- reported as p <.01 in research literature.
-
- The ANOVA tells you only that there is a difference among the feeds.In
- order to find out which groups are significantly different from which
- others, press M to choose (M)ultiple comparison. The Newman-Keuls
- multiple comparison test will describe which of the means are
- significantly different from which others (at the 0.05 significance
- level).
-
- Figure 4.8 displays a graphical representation of the Newman-Keuls
- multiple comparisons test. The group numbers are given in increasing
- order of the value of their group means. That is, Group 1 has the
- smallest mean, Group 3 the largest. At the 0.05 significance level, the
- means of any two groups underscored by the same line are not
- significantly different. This display tells you that (at the 0.05
- significance level):
-
- 1) The mean for group 1 (feed 1) is statistically significantly less than
- the means for all other groups.
-
- 2) The mean for group 2 (feed 2) is significantly greater than the mean
- for group 1, and significantly less than the means of groups 4 and 3.
-
- 3) The means for groups 4 and 3 are not significantly different from each
- other, but they are both significantly greater than the means of groups 1
- and 2.
-
- You can conclude that feeds 3 and 4 are better than feeds 1 and 2, but
- there is not enough evidence to say that either feed 3 or 4 is the
- best overall.
-
- Box plots are also available to graphically illustrate the differences
- between the groups. Type G (for graphical comparison) and press Enter
- to produce the plots.
-
-
-
- --------------------------------------------------------------------
- Become a Registered User, Print Order Form
-
-
- 33
-
-
-
- KWIKSTAT CONDENSED MANUAL Version 3.3
- --------------------------------------------------------------------
-
-
- PARAMETRIC REPEATED MEASURES (PAIRED) ANALYSIS
-
- Repeated measures are observations taken on the same or related
- subjects over time or in differing circumstances. Examples would be
- weight loss, or reaction to a drug across time. Repeated measures may
- also be matched subjects.
-
- In this module, as in the independent groups module, a t-test is
- performed when there are two groups (two repeated measures), and an
- analysis of variance is performed by KWIKSTAT if there are three to
- ten groups. The ANOVA determines if there is a difference in the means
- across groups or repeated measures. A multiple comparison procedure
- further identifies where the differences lie.
-
- In a database for paired or repeated measures data, each record
- represents one subject (e.g., person, animal). There must be one field
- for each repeated measure (each treatment group). For paired data,
- there are two groups, hence two fields. Thus, in each record, there is
- a field in which to enter data from each observation (treatment) on
- that subject. This repeated measures (paired) analysis requires that
- all values be available for each subject and any subject with missing
- values is eliminated from the analysis. That is, a data record must
- have a value for each field, or it will be eliminated.
-
- The hypotheses being tested with a paired t-test or a repeated measures
- ANOVA is:
-
- Ho: There is no difference among means of the groups (repeated measures).
- Ha: There is a difference among means of the groups.
-
- For comparing matched or paired data (not independent) from two groups, a
- paired t-test is used.
-
- EXAMPLE 4.9: PAIRED T-TEST
-
- The data in this example are before and after weights for eight persons
- on a diet. Notice that in this case, both data values are taken from the
- SAME entity (person).
-
- Data for paired t-test
-
- Person Before After
- 1 162 168
- 2 170 136
- 3 184 147
- 4 164 159
- 5 172 143
- --------------------------------------------------------------------
- Become a Registered User, Print Order Form
-
-
- 34
-
-
-
- KWIKSTAT CONDENSED MANUAL Version 3.3
- --------------------------------------------------------------------
-
-
- 6 176 161
- 7 159 143
- 8 170 145
-
- CREATING THE DATABASE
-
- The database will include two fields (BEFORE and AFTER) and eight
- records, one for each person. Since the observations are paired, not
- independent, the database reflects this by having each record contain a
- pair of observations. Each record, that is, each person, is independent
- of the over seven persons, but within a record, the before and after
- observations are not independent of each other. To create this database:
-
- Create a database called DIET containing the data list above. Use
- the pre-defined database structure called,
-
- FOR PAIRED T-TEST OR McNEMAR's TEST
-
- This will create a database with the fields VAR1 and VAR2. The VAR1 will
- be used for Before and VAR2 will be used for After. Of course, you can
- choose to create a custom database and enter a structure containing the
- fields named BEFORE and AFTER.
-
- Enter the data into the database. The data you will enter in the first
- record is 162 (press Enter) and 168 (press Enter). Enter the data for
- the eight records. When you exit the entry procedure, KWIKSTAT will
- return to the Data main menu.
-
- PERFORMING THE ANALYSIS
-
- Select the Analyze option form the main KWIKSTAT menu. Next, choose
- the t-tests and Analysis of Variance (ANOVA) option. The t-test and
- Analysis of Variance Choose Analysis Option menu will appear. Then
- choose Compare repeated or paired data (t-test, ANOVA).
-
- You will be prompted to choose the fields which you wish to compare.
- Choose BEFORE and AFTER. KWIKSTAT will now perform the calculations
- and display the results on the screen.
-
- The means and standard deviations for each group are displayed, but
- more importantly, the mean difference between BEFORE and AFTER
- measurements is given. The statistical procedures are performed on
- this average difference. A 95% confidence interval for the mean
- difference is given, as well as a calculated t-statistic and a
- p-value. These results are interpreted like those of a single sample
- t-test with null hypothesis: mean=0, and alternative hypothesis: mean
- <> 0.
- --------------------------------------------------------------------
- Become a Registered User, Print Order Form
-
-
- 35
-
-
-
- KWIKSTAT CONDENSED MANUAL Version 3.3
- --------------------------------------------------------------------
-
-
-
- The calculated t-statistic is 2.37. The test is performed with 7 degrees
- of freedom, and the p-value associated with the test is 0.008. A small
- p-value such as this is usually interpreted to indicate rejection of the
- null hypothesis and leads to the conclusion that the average difference
- in BEFORE and AFTER weights is not zero, i.e., there is evidence of a
- significant (at the 0.05 level) change of weight in these eight subjects
- on average. Since KWIKSTAT uses the difference "second minus first
- variable" (i.e., "before minus after") to compute the t-statistic, and
- since the mean difference is positive, on average before weights are
- greater than after weights which implies that the change is loss of
- weight on average.
-
- INDEPENDENT GROUP TESTS FROM SUMMARY DATA
-
- This option allows you to perform a one-way ANOVA or a t-test if you
- have only the means, standard deviations and group sizes of two to ten
- groups. Since data are summary, no box plots can be given.
-
-
- USING NON-PARAMETRIC COMPARATIVE PROCEDURES
-
- Non-parametric procedures are appropriate when the assumption of
- normality cannot be made for a small data set or when a large data set
- is known to be from a non-normal population. Non-parametric procedures
- are generally based on ranks rather than actual data values, so these
- procedures can be useful also when actual data values are not known,
- but the order or ranks of the data values are known.
-
- NON-PARAMETRIC INDEPENDENT GROUP ANALYSIS - MANN-WHITNEY AND
- KRUSKAL-WALLIS TESTS
-
- This option is appropriate if you are comparing two or more
- independent groups, but you cannot make the assumption that the
- observed data follow a normal distribution or that the variances are
- equal. It is also useful if you do not have exact data values for the
- observations but you do have order statistics, that is, you don't know
- the exact response values but you know which is largest, next largest,
- and so forth, to smallest. The samples must be randomly and
- independently taken from populations that differ only with respect to
- location, and the variable of interest should be continuous. See Zar,
- 1984.
-
- In the Non-Parametric Comparison Tests Module, KWIKSTAT uses the
- Mann-Whitney procedure if two independent groups are being compared,
- and the Kruskal-Wallis procedure if three or more groups are being
- compared. The hypotheses being tested are:
- --------------------------------------------------------------------
- Become a Registered User, Print Order Form
-
-
- 36
-
-
-
- KWIKSTAT CONDENSED MANUAL Version 3.3
- --------------------------------------------------------------------
-
-
-
- Ho: There is no difference in the medians of the groups.
- Ha: There is a difference in the medians of the groups.
-
- The Mann-Whitney and Kruskal-Wallis non-parametric procedures differ from
- the independent groups analysis described in the previous section in that
- the ranks, or order, of the data are used for the analysis rather than
- the data values themselves.
-
- FRIEDMAN'S TEST
-
- When repeated observations are taken from the same subject, and there
- is interet in comparing the observations for each repeated measure
- (e.g., each type of treatment), then a repeated measures analysis may
- be appropriate. If you cannot make the assumption that the data are
- normal, the a nonparametric analysis is appropriate. One method of
- performing such as test the Friedman's Analysis.
-
- COCHRAN'S Q TEST
-
- Cochran's Q test is a non-parametric procedure appropriate for use
- with dichotomous data when the experiment involves repeated measures
- on blocks. Often the blocks are subjects (people or animals). The
- response of the subjects to the treatments is dichotomous if it is
- taken as one of only two possible outcomes, often labeled "success"
- and "Failure", rather than as a measurment.
-
-
- USING REGRESSION & CORRELATION PROCEDURES
-
- To examine the linear relationship between variables, correlation and
- linear regression are used.
-
- Simple linear regression is used for predicting a value of a dependent
- variable using an independent variable. Multiple regression is used
- for predicting the value of a dependent using one or more independent
- variables. Correlation is used to measure the strength of association
- between two variables. For example, you may be interested in relating
- advertising to orders received. The question you are asking is, "Is
- there a relationship between the amount of money spent on advertising
- and the amount of orders received?" It is also possible to compare
- more than two variables at a time using multiple regression. For
- example, you may be interested in how the combination of radio
- advertising costs, direct mail costs and commissions relate to the
- number of orders received.
-
-
- --------------------------------------------------------------------
- Become a Registered User, Print Order Form
-
-
- 37
-
-
-
- KWIKSTAT CONDENSED MANUAL Version 3.3
- --------------------------------------------------------------------
-
-
- SIMPLE LINEAR REGRESSION ANALYSIS
-
- When you choose the Simple Linear Regression option, KWIKSTAT will
- prompt you to choose the "independent" and "dependent" variables to be
- used in the analysis. The "independent" variable is generally that
- variable that you can choose, regulate or specify (e.g., amount of
- money spent on advertising) and the "dependent" variable is the one
- you observe and would possibly like to predict. After the two
- variables are chosen, KWIKSTAT will present the results of its
- calculations.
-
- The regression equation will be displayed along with other results.
- This equation is the "least squares" line fitted to the data. If the
- fit is appropriate, the equation may be used to predict a new value of
- the dependent variable given the value of the independent variable,
- within the range of the original data.
-
- MULTIPLE REGRESSION ANALYSIS
-
- Multiple regression is an extension of simple linear regression into
- several dimensions (several independent variables). In the multiple
- regression procedure, you must enter a list of the independent
- variables and a single dependent variable on which you wish to perform
- the regression analysis. In KWIKSTAT you may use up to 10 independent
- variables in this option. Multiple regression can be complicated.
- Refer to a good text on the subject before making any conclusions
- about your results.
-
- KWIKSTAT calculates and displays several results, including the
- coefficients and intercept of the regression "line". A significance
- test is performed to determine the significance of the contribution of
- the different variables or factors to the model (mathematical
- representation).Also displayed is R-square (R2), as well as adjusted
- R-square. R-square varies from 0.0 to 1.0, with 0.0 meaning no
- relationship (model is not good) and 1.0 meaning the regression
- equation perfectly describes the sample data.
-
- An analysis of variance is performed to determine the overall
- significance of the model. If the ANOVA reveals a significant
- relationship, (that is, if the p-value is small) the model may be a
- good representation of the sample data.
-
- A plot of residuals from the fit is available. You may plot the fit
- against any of the variables. Look for patterns in the residuals.
- Patterns other than a horizontal band about zero suggest that the
- assumptions necessary for regression analysis may be violated. If you
- are unfamiliar with multiple regression, the Neter and Wasserman book
- --------------------------------------------------------------------
- Become a Registered User, Print Order Form
-
-
- 38
-
-
-
- KWIKSTAT CONDENSED MANUAL Version 3.3
- --------------------------------------------------------------------
-
-
- contains an excellent treatment.
-
- EXAMPLE 4.20: MULTIPLE REGRESSION ANALYSIS (LONGLEY DATA)
-
- Longley introduced a data set which has often been used in comparing
- multiple linear regression procedures in the literature. The variables
- refer to economic factors. This example uses the LONGLEY database on
- the KWIKSTAT disk.
-
- The LONGLEY database consists of 7 fields: DEFLATOR, GNP, UNEMP, ARMED,
- POP, TIME, and TOTAL. The first six of these will be used as independent
- variables and the seventh, TOTAL, is the dependent variable (the one to
- be predicted). Figure 4.15 displays the LONGLEY database. You can get
- this display by using the List (display) the contents of a database
- option on the Data main menu.
-
- PERFORMING THE ANALYSIS
-
- Open the LONGELY database. From the Analyze menu in the main KWIKSTAT
- module, select the Regression & Correlation module. From the
- Correlation and Regression menu select Multiple Linear Regression
- analysis. You will be prompted to enter the INDEPENDENT VARIABLE(S),
- which in this case are DEFLATOR, GNP, UNEMP, ARMED, POP, TIME. Enter
- any combination of 1,2,3,4,5,6 to choose the variable(s) you wish to
- analyze against TOTAL. One way to approach a multiple regression
- problem is to first include all of the independent variables. After
- initial analysis (see below) you may decide to eliminate those
- independent variables found to not be significant.
-
- After entering the independent variables, you will be asked for the
- DEPENDENT VARIABLE. Enter 7, which chooses TOTAL. KWIKSTAT will now
- perform the calculations and display the results on the screen, as
- illustrated in Figure 4.16.
-
- EXAMPLE 4.22: CORRELATION MATRIX (LONGLEY DATA)
-
- Select the Correlation matrix option from the Regression and
- Correlation menu. You will be prompted to choose variables from the
- list of fields that appears. In this case, there are seven fields, and
- you can choose any combination of them. If you want correlation
- coefficients of all pairs of the seven variables, type 1,2,3,4,5,6,7
- and press Enter. KWIKSTAT will perform the calculations and display
- the 7 by 7 array shown in Figure 4.17. Only half of the array is
- displayed since the other half is a mirror image.
-
- The diagonal entries are also omitted since they are all one; a
- variable is always perfectly correlated with itself. Each entry in the
- --------------------------------------------------------------------
- Become a Registered User, Print Order Form
-
-
- 39
-
-
-
- KWIKSTAT CONDENSED MANUAL Version 3.3
- --------------------------------------------------------------------
-
-
- array consists of two numbers (three numbers if the information is
- viewed or printed to a printer). The first (upper) is the Pearson's
- correlation coefficient for the two (row and column) variables of that
- entry. The second (middle) number, in parentheses, is the p-value of
- the t-test for Ho: rho = 0 vs. Ha: rho <> 0. If you view the resuts,
- the third (bottom) number, in brackets, is the sample size, or number
- of paired observations used in the calculations.
-
- Both the correlation coefficient and the p-value are interpreted as they
- are for any correlation of two variables (see Example 4.21 above). In
- this array, for example, POP and TIME are highly correlated (r=0.994,
- p=0.00) but POP and ARMED are not (r= 0.364, p=0.17).
-
- EXAMPLE 4.23: GRAPHICAL CORRELATION MATRIX (LONGLEY DATA)
-
- From the Simple and Multiple Regression Choose Analysis Option menu,
- select Graphical Correlation matrix. You will be prompted to choose
- variables from the list of fields that appears. In this case, there
- are seven fields, and you can choose any combination of them. If you
- want correlation coefficients of all pairs of the seven variables,
- type 1,2,3,4,5,6,7 and press Enter. KWIKSTAT will perform the
- calculations and display the 7 by 7 array of scatterplots shown in
- Figure 4.18.
-
- These scatterplots are a visual way of examining the relationships
- between pairs of variables. It allows you to determine if a
- relationship exists between the variables, and allows you to see if
- that relationship is linear. The more highly correlated two variables
- are, the more tightly clustered about a straight line are the points
- on the scatterplot.
-
-
- USING FREQUENCY AND CROSSTABULATION PROCEDURES
-
- The Crosstabulations, Frequencies, Chi Square module performs analyses
- on categorical data, that is, data observed in categories, rather than
- measurement data. Previous examples using measurement data include
- weights of hogs, weights of people, heights of plants, numbers of
- handguns and homicides, and dollar amounts. If, rather than taking a
- measurement, a data observation involves identifying which of a set of
- categories the observation falls into, you are working with
- categorical data.
-
- For example, you may identify a person by sex, eye color or hair
- color. You may identify families, individuals or institutions by
- geographical region or socioeconomic status identified by levels
- 1,2,3,4,5. You may identify employees by level of job satisfaction,
- --------------------------------------------------------------------
- Become a Registered User, Print Order Form
-
-
- 40
-
-
-
- KWIKSTAT CONDENSED MANUAL Version 3.3
- --------------------------------------------------------------------
-
-
- where there are three or four levels to choose from.
-
- Generally, categorical data are entered into a database by using one
- record for each person or entity on which the observation is made and
- one field for each characteristic which is divided into categories.
- For example, to categorize ten people by sex, hair color and eye
- color, you would need ten records (one per person) and three fields
- (e.g., SEX, HAIR, EYE). Some of the procedures in this module give you
- the choice of simply entering totals for each category rather than
- creating a database and entering the results of each observation. This
- can save time if totals are known and only totals are needed to
- perform a test or calculation or to produce a graph.
-
- PERFORMING A FREQUENCIES ANALYSIS
-
- In the Frequencies option, KWIKSTAT "counts" the occurrence of each
- data value for a single variable or field and displays that
- information in a table. You can also create a bar chart, pictograph
- and/or pie chart of this information using this option.
-
- EXAMPLE 4.24: FREQUENCY TABLE, PICTOGRAPH, BAR AND PIE CHARTS
-
- This example uses the EXAMPLE database file on the KWIKSTAT disk. One of
- the fields (variables) in this database is STATUS referring to
- socioeconomic status. Suppose you want to know how the total data set
- is divided up into the five levels of STATUS. You also want to produce
- a visual display of this information. Open the EXAMPLE database.
-
- Performing the Analysis
-
- From the Frequencies and Crosstabulations Choose Analysis Option menu,
- select Frequencies, Pictograph, Pie Chart. You will be prompted to
- enter one field (variable) to use. Since you want to do a frequency
- table on STATUS, enter 7. KWIKSTAT will count the data in each of the
- five categories of STATUS and display the results as a frequency
- table.
-
- You are then prompted to press Enter, which takes you to the Frequencies
- Analysis menu. From this menu you may choose to go back and do another
- analysis, or create charts.
-
-
-
-
-
-
-
- --------------------------------------------------------------------
- Become a Registered User, Print Order Form
-
-
- 41
-
-
-
- KWIKSTAT CONDENSED MANUAL Version 3.3
- --------------------------------------------------------------------
-
-
- PERFORMING A GOODNESS OF FIT ANALYSIS
-
- A goodness-of-fit test of a single population is a test to determine if
- the distribution of observed frequencies in the sample data closely
- matches the expected number of occurrences under a hypothetical
- distribution of the population. The data observations must be independent
- and each data value can be counted in one and only one category. It is
- also assumed that the number of observations is fixed. The hypotheses
- being tested are
-
- Ho: The population distribution begin samples follows the hypothesized
- distribution.
- Ha: The population does not follow the hypothesized distribution.
-
- PERFORMING A CROSSTABULATION ANALYSIS (CHI-SQUARE)
-
- Crosstabulations can be used to perform a chi-square test for
- independence or a chi-square test for homogeneity. A two-way table is
- constructed that displays the number of counts for each category. It
- must be possible to assume that the data observations are independent
- and that each data value can be counted in one and only one category.
- It is also assumed that the number of observations is fixed. KWIKSTAT
- allows you to enter data for a two-way table from the keyboard or from
- a database. When you choose to enter the two-way table from the
- keyboard, KWIKSTAT will ask you the size of the table (number of rows
- and columns). A blank table will be presented on the screen, and you
- will then be prompted to enter a number in each cell of the table.
-
- If you choose to enter the information from a database, KWIKSTAT will
- prompt you to enter a list of tables to be calculated. For example '2 BY
- 3' specifies a tablulation of field 2 by field 3. The specification '1
- BY 2,3' specifies the tables 1 BY 2 and 1 BY 3. You may list up to ten
- field numbers on both sides of the BY. KWIKSTAT will read the information
- from the database, and construct the table or tables. For instance, in
- the EXAMPLE database, if you choose to tabulate the variables GROUP and
- STATUS, KWIKSTAT will form the table on the screen as illustrated in
- Figure 4.23. (Note that the first variable entered is the row
- variable.)
-
- When requesting multiple plots in the BY specification, you will be
- given the option to choose to pause after each table is displayed, or
- to continue non-stop and print all of the requested tables to the
- printer to to a file.
-
- For a test for independence, a contingency table looks at two
- categorical variables from a single sample of one population and tests
- whether the two variables are related in some way, (e.g., are sex and
- --------------------------------------------------------------------
- Become a Registered User, Print Order Form
-
-
- 42
-
-
-
- KWIKSTAT CONDENSED MANUAL Version 3.3
- --------------------------------------------------------------------
-
-
- hair color related?)
-
- The hypotheses being tested are:
-
- Ho: The variables are independent of each other. (There is no
- association between them).
-
- Ha: The variables are not independent of each other.
-
- KWIKSTAT reports both the chi-square statistic and the p-value. If the
- expected value (Eij) in one or more cells is less than 5, the
- chi-square test may not be valid. A warning to this effect appears on
- the screen if appropriate. In the case of a 2 by 2 table, Fisher's
- Exact Test and the chi-square with Yates' correction are also
- performed and results displayed.
-
- EXAMPLE 4.26: CROSSTABULATION ANALYSIS (2 BY 2) TEST FOR INDEPENDENCE
-
- Data for this example are observations of the number of beetles and
- bugs on the upper and lower sides of leaves (Zar,1974, page 292). 2 by
- 2 Contingency Table Data
-
- Beetles Bugs
- ---------------
- Upper Leaf 12 7
- Lower Leaf 2 8
-
- Since you are given only the totals for each of the four categories,
- and not the individual data for each leaf, there is no need to create
- a database. Rather, you can just enter these totals from the keyboard.
-
- The calculated chi-square statistic in this case is 4.89 with a
- p-value of 0.028. The chi-square with Yates correction is 3.31 with a
- p-value of 0.069 and the Fisher Exact Test (two tail) has a p-value of
- 0.050. Because one of the cells produces an expected value less than
- 5, KWIKSTAT gives a warning that the chi-square analysis for this data
- may not be valid. Given this warning, it is best to rely on the
- Fisher's Exact Test for making a decision. A decision can be made
- using the p-value of the test. A low p-value (less than or equal to
- the chosen significance level) is usually taken to indicate rejection
- of the null hypothesis. At a 0.05 significance level, the Fisher's
- Exact Test p-value of 0.050 indicates (on the borderline) that there
- is enough evidence to reject the null hypothesis of independence of
- the two variables and to conclude that leaf side and type of insect
- are not independent. In this case it appears that beetles prefer the
- upper sides of leaves and bugs are about split in their preference. In
- the case of the Yates results, this decision is marginal. After
- --------------------------------------------------------------------
- Become a Registered User, Print Order Form
-
-
- 43
-
-
-
- KWIKSTAT CONDENSED MANUAL Version 3.3
- --------------------------------------------------------------------
-
-
- viewing the crosstabulation results, press Enter and a
- Crosstabulations Analysis menu will appear. This menu gives you the
- options of doing another analysis, printing what you have done, or
- producing a 3-dimensional bar chart. Select by highlighting the
- desired option and pressing Enter.
-
- DRAWING A 3-D BAR CHART
-
- KWIKSTAT allows you to draw a 3-dimensional bar chart of data for a
- contingency table (crosstabulation), and then to focus in on a part of
- it if desired.
-
- Data for the 3-dimensional bar chart must be entered first, either
- from the keyboard or a database, by using the Crosstabulations,
- Chi-Square option of the Frequencies and Crosstabulations Choose
- Analysis Option menu. To get to this menu from the Data main menu,
- select Analyze at the top of the screen, and then select
- Crosstabulations, Frequencies, Chi Square.
-
- MCNEMAR'S TEST
-
- McNemar's test is appropriate for use with paried, dichotomous data.
- This test is sometimes called a test for related samples or a test for
- the significance of changes. It is used ehen the response is one of
- only two possible outcomes. McNemar';s test is the 2 by 2 version of
- Cochran's Q test described earlier. The test assumes that any pair of
- observations is independent of any other pair or observations,
- although clearly the observations with a pair are not independent of
- each other.
-
- USING LIFE TABLES AND SURVIVAL ANALYSIS PROCEDURES
-
- As the name indicates, this module performs life tables and survival
- analysis procedures. The data must be in the following form:
-
- 1) a TIME variable which contains a time (e.g., minutes, days, years,
- etc.) in which the subject or component has been observed to be alive
- (not failed).
-
- 2) a CENSOR variable which must take on the values 0 or 1, where 1
- means the subject has died (failed), and a 0 means the subject was
- still alive (not failed) at the last available time period.
-
- 3) optionally, a GROUPING variable which may have up to ten values
- (numeric or character), i.e., the data may be in groups.
-
- Once the data are entered into the program, a life table for each
- --------------------------------------------------------------------
- Become a Registered User, Print Order Form
-
-
- 44
-
-
-
- KWIKSTAT CONDENSED MANUAL Version 3.3
- --------------------------------------------------------------------
-
-
- group is produced which includes, for each time interval, the number
- entered, withdrawn, lost, dead, exposed, the proportion dead,
- proportion surviving, cumulative proportion surviving, hazard and
- density. A plot is given for the cumulative proportion surviving in
- the group(s) against time. If more than one group is entered, a
- Mantel-Haenszel test is performed to test the hypothesis of equal
- survival patterns for the groups.
-
- A small version of the survival plot will appear on the screen, and if
- you choose to print a report of the session the report will include a
- larger version of the plot along with other information from the
- analysis.
-
- PERFORMING A LIFE TABLE ANALYSIS
-
- Survival analysis is used to summarize information in life tables, to
- examine survival trends over time, and to compare survival times
- between groups.
-
- EXAMPLE 4.31A: LIFE TABLE ANALYSIS
-
- The data for this example are in the LIFE database on the KWIKSTAT
- disk. These data are from Prentice (1973).
-
- Open the database named LIFE. The LIFE database consists of 3 fields:
- SURVIVAL, CENSOR, and GROUP. Figure 4.30 displays a portion of the
- LIFE database. You can get this display by using the List (display)
- the contents of a database option on the Data main menu.
-
- The first column is the SURVIVAL field with entries of length of life, or
- length of survival. The second column is the CENSOR field, an indicator
- of whether the subject has failed (died) or not at the last observed time
- period. 1 means failed, 0 means not failed (still alive).
-
- The third column contains a grouping variable. In this case it is either
- 1 or 2. Group 1 may represent one treatment, while group 2 represents
- another kind of treatment. The objective is to compute survival curves to
- see if the treatments provide different average survival distributions.
-
-
-
-
-
-
-
-
-
- --------------------------------------------------------------------
- Become a Registered User, Print Order Form
-
-
- 45
-
-
-
- KWIKSTAT CONDENSED MANUAL Version 3.3
- --------------------------------------------------------------------
-
-
- APPENDIX
-
- INTERPRETING ERROR CODES
-
- If the program encounters a problem it does not know how to resolve,
- it will usually display an error message. This message will contain an
- error code and a reference code. Many times, you can correct this
- error situation by understanding what caused it. For example, if you
- were to get an error number 27, you would know that it was caused by
- your printer sending an "Out of Paper" message to the program. If you
- are unable to resolve the problem, write down the steps taken before
- the error occured, and send it to TexaSoft on the Problem Report Form.
- We will try to resolve the problem as quickly as possible.
-
- Error Codes:
- Error Number 5 = Illegal function call
- Error Number 6 = Overflow
- Error Number 7 = Out of Memory
- Error Number 9 = Subscript out of range
- Error Number 11 = Division by zero
- Error Number 14 = Out of String Space
- Error Number 24 = Device Timeout
- Error Number 25 = Device fault
- Error Number 27 = Out of Paper
- Error Number 50 = FIELD overflow
- Error Number 51 = Internal Error
- Error Number 52 = Bad filename or number
- Error Number 53 = File not found
- Error Number 54 = Bad file mode
- Error Number 55 = File already open
- Error Number 57 = Device I/O error
- Error Number 58 = File already exists
- Error Number 61 = Disk full
- Error Number 62 = Input past end of file
- Error Number 63 = Bad record number
- Error Number 64 = Bad filename
- Error Number 67 = Too many files
- Error Number 68 = Device unavailable
- Error Number 70 = Permission denied
- Error Number 71 = Disk not ready
- Error Number 72 = Disk media error
- Error Number 74 = Rename across disks
- Error Number 75 = Path/File access error
- Error Number 76 = Path not found
- Error Number 81 = Invalid filename
-
-
- --------------------------------------------------------------------
- Become a Registered User, Print Order Form
-
-
- 46
-
-
-
- KWIKSTAT CONDENSED MANUAL Version 3.3
- --------------------------------------------------------------------
-
-
- Problem Report form: KWIKSTAT
-
- Please explain in detail the problem that occurred. If possible,
- send a print out of the results or Print Screen.
- KWIKSTAT VERSION YOU ARE USING:________________________
- KWIKSTAT MODULE where problem occurred:____________________
- YOUR COMPUTER: BRAND/Model_____________________________
- MONITOR TYPE:________AMOUNT OF MEMORY:_______________
- VERSION OF DOS YOU ARE USING:____________________________
- MEMORY RESIDENT PROGRAMS YOU USE:____________________
- PROBLEM:
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
- Mail to:TexaSoft, P.O. Box 1169, Cedar Hill, Texas 75104. Or fax to
- 214-291-3400, or send E-Mail to Compuserve 70721,3145.
-
-
-
-
-
-
-
-
-
- --------------------------------------------------------------------
- Become a Registered User, Print Order Form
-
-
- 47
-
-
-
- KWIKSTAT CONDENSED MANUAL Version 3.3
- --------------------------------------------------------------------
-
-
- USER'S BALLOT
-
- Please indicate your preference for improvements in KWIKSTAT. On a
- scale of 0 to 10
- 0 = Very Low priority for this change
- 10 = Very High priority for this change
-
- Vote Proposed item of change
- ---- -----------------------------------------------------
- ____ More "BY GROUP" capabilities
- ____ Ability to sort database
- ____ Add more ANOVA types
- ____ Add more Non-parametric statistical tests
- ____ Add General Linear Model
- ____ Make Report more flexible
- ____ Add Quality Control Module
- ____ Speed up program functions
- ____ Add more graphics, what kind?
- ____ Improve graphic quality
- ____ Add cluster analysis
- ____ Add discriminant analysis
- ____ Automate analysis from a command file
- ____ _____________________________________________
- ____ _____________________________________________
- Other Comments:
-
- Mail to:TexaSoft, P.O. Box 1169, Cedar Hill, Texas 75104. Or fax to
- 214-291-3400, or send E-Mail to Compuserve 70721,3145.
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
- --------------------------------------------------------------------
- Become a Registered User, Print Order Form
-
-
- 48
-
-
-
- KWIKSTAT CONDENSED MANUAL Version 3.3
- --------------------------------------------------------------------
-
-
-
- S H A R E W A R E
- _________________
-
- TRY IT BEFORE YOU BUY IT
-
- The purpose of shareware products is to allow you to try software
- products before you buy them. KWIKSTAT is not a public domain
- program. Persons who use KWIKSTAT on a regular basis should purchase a
- copy. You receive several benefits from becoming an official
- registered user:
-
- 1. You help to keep the product growing to meet your needs.
-
- 2. You receive the very latest version, with a printed, bound, and
- expanded manual.
-
- 3. You receive periodic newsletters announcing new releases, and
- pointing out important information on any bugs and fixes.
-
- 4. You are able to purchase update to new versions for a minimal cost.
-
- Print the file on disk named KSORDER.TXT to register. Thanks.
-
- ⁄ƒƒƒƒƒƒƒƒƒø
- ⁄ƒƒƒƒƒ¡ƒƒƒø ≥ (tm)
- ƒƒ≥ ≥o ≥ƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒ
- ≥ ⁄ƒƒƒƒƒ¡–ƒƒø ≥ Association of
- ≥ ≥ ≥ƒŸ Shareware
- ¿ƒƒƒ≥ o ≥ Professionals
- ƒƒƒƒƒƒ≥ ∫ ≥ƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒ
- ¿ƒƒƒƒ–ƒƒƒƒŸ MEMBER
-
- This program is produced by a member of the Association of Shareware
- Professionals (ASP). ASP wants to make sure that the shareware
- principle works for you. If you are unable to resolve a shareware-
- related problem with an ASP member by contacting the member directly,
- ASP may be able to help. The ASP Ombudsman can help you resolve a
- dispute or problem with an ASP member, but does not provide technical
- support for members' products. Please write to the ASP Ombudsman at
- 545 Grover Road, Muskegon, MI 49442 or send a CompuServe message via
- CompuServe Mail to ASP Ombudsman 70007,3536.
-
-
-
-
-
-
-
-
-
- 49
-
-